BEVStereo: Enhancing Depth Estimation in Multi-View 3D Object Detection with Temporal Stereo

نویسندگان

چکیده

Restricted by the ability of depth perception, all Multi-view 3D object detection methods fall into bottleneck accuracy. By constructing temporal stereo, estimation is quite reliable in indoor scenarios. However, there are two difficulties directly integrating stereo outdoor multi-view detectors: 1) The construction stereos for views results high computing costs. 2) Unable to adapt challenging In this study, we propose an effective method creating dynamically determining center and range stereo. most confident found using EM algorithm. Numerous experiments on nuScenes have shown BEVStereo's deal with complex scenarios that other stereo-based unable handle. For first time, a approach shows superiority like static ego vehicle moving objects. BEVStereo achieves new state-of-the-art camera-only track dataset while maintaining memory efficiency. Codes been released.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multi-view Depth Map Estimation Enhancing Temporal Consistency

In this paper, we propose a scheme for multiview depth map estimation to enhance temporal consistency. After we divide the center image into several segments, we estimate one depth value for each segment using 3-D warping and segment-based matching techniques. In the refinement process, we apply a segment-based belief propagation algorithm. In order to enhance temporal consistency and reliabili...

متن کامل

Multi-Stereo 3D Object Reconstruction

We present a method for the reconstruction of a 3D real object from a sequence of high-definition images. We combine two different procedures: a shape from silhouette technique which provides a coarse 3D initial model followed by a multi-stereo carving technique. We propose a fast but accurate method for the estimation of the carving depth at each vertex of the 3D mesh. The quality of the final...

متن کامل

Multi-View 3D Pose Estimation from Single Depth Images

In this paper, we investigate the problem of multi-view 3D human pose estimation from depth images using deep learning methods. We utilize an iterative approach that progressively makes changes to an initial mean pose by feeding back error predictions. Our model is evaluated on a newly collected dataset (ITOP) that contains 30K annotated depth images from top-down and frontal views. Experiments...

متن کامل

Multi-View Depth Map Estimation With Cross-View Consistency

Motivation. Multi View Stereo (MVS) aims to establish 3D models from multiple calibrated images. Some works use region growing to estimate depth map per view, and then merge the results. They either only deal with reliable regions, or have difficulty in parallelizing. More crucially, due to the view-independent estimation, inconsistent outliers may exist and grow during propagation, producing u...

متن کامل

Multi-View Stereo 3D Edge Reconstruction

This paper presents a novel method for the reconstruction of 3D edges in multi-view stereo scenarios. Previous research in the field typically relied on video sequences and limited the reconstruction process to either straight linesegments, or edge-points, i.e., 3D points that correspond to image edges. We instead propose a system, denoted as EdgeGraph3D, able to recover both straight and curve...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2023

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v37i2.25234